Knowledge Bases in Debugging Parallel and Distributed Systems Based on Event Traces
نویسنده
چکیده
Systematical event trace analysis requires both detailed knowledge about the event trace analysis environment used and detailed knowledge about the object system itself. This article presents three important knowledge bases which, when contained in event trace analysis environments, facilitate achieving useful performance results of the system under investigation. ParaGraph — a dedicated event trace analysis environment — and SIMPLE — a universal event trace analysis environment — will be examined according to the knowledge bases defined.
منابع مشابه
Combining Functional And Performance Debugging Of Parallel And Distributed Systems Based On Model-driven Monitoring
In order to program parallel and distributed systems efficiently, a systematic way of defining and understanding the complex behavior of process interactions in concurrent programs is needed. A proven method for understanding existing programs is event-driven monitoring which abstracts the program behavior to a sequence of events. These event traces are analyzed for debugging and tuning of the ...
متن کاملTransparent System Call Based Performance Debugging for Cloud Computing
Problem Diagnosis and debugging in concurrent environments such as the cloud and popular distributed systems frameworks has been a traditionally hard problem. We explore an evaluation of a novel way of debugging distributed systems frameworks by using system calls. We focus on Google's MapReduce framework, which enables distributed, data-intensive, parallel applications by decomposing a massive...
متن کاملFormal Modeling of Communication Traces
Development of distributed systems is complicated by the absence of reliable global timing, concurrency, and nondeterminism. To deal with these obstacles log files are produced by an instrumented system facilitating analysis, testing, and debugging. This paper presents a formal framework for the analysis of distributed system logs based on event trace concept. A partially ordered trace of event...
متن کاملComparison , Replay , and Refinement of Communication Traces for Debugging Distributed Failures
An increasing number of companies build their business on distributed Web applications. Hosting providers respond to that demand and made it easier to deploy systems that spread across multiple services. However, this trend has outpaced the development of adequate debugging tools and developers still have to rely on an improvised patchwork of symbolic debuggers and printf debugging to find fail...
متن کاملDebugging and Testing Middleware with Aspect-Based Control-Flow and Causal Patterns
Many tasks that involve the dynamic manipulation of middleware and large-scale distributed applications, such as debugging and testing, require the monitoring of intricate relationships of execution events that trigger modifications to the executing system. Furthermore, events often are of interest only if they occur as part of specific execution traces and not all possible non-deterministic in...
متن کامل